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Abstract: 

The purpose of this study was to find the most accurate model for radiochromic 
film dosimetry by comparing different channel independent perturbation models. A 
model selection approach based on (algorithmic) information theory was followed, 
and the results were validated using gamma-index analysis on a set of benchmark 
test cases. Several questions were addressed: (a) whether incorporating the informa¬ 
tion of the non-irradiated film, by scanning prior to irradiation, improves the results; 
(b) whether lateral corrections are necessary when using multichannel models; (c) 
whether multichannel dosimetry produces better results than single-channel dosime¬ 
try; (d) which multichannel perturbation model provides more accurate film doses. It 
was found that scanning prior to irradiation and applying lateral corrections improved 
the accuracy of the results. For some perturbation models, increasing the number 
of color channels did not result in more accurate film doses. Employing Truncated 
Normal perturbations was found to provide better results than using Micke-Mayer 
perturbation models. Among the models being compared, the triple-channel model 
with Truncated Normal perturbations, net optical density as the response and subject 
to the application of lateral corrections was found to be the most accurate model. 
The scope of this study was circumscribed by the limits under which the models 
were tested. In this study, the films were irradiated with megavoltage radiotherapy 
beams, with doses from about 20 cGy to 600 cGy, entire (8 inch x 10 inch) films were 
scanned, the functional form of the sensitometric curves was a polynomial and the 
different lots were calibrated using the plane-based method. 
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I. INTRODUCTION 


Near water-equivalence 1,2 , high spatial resolution and weak energy dependence across a 
broad range of energies 3-8 make radiochromic film dosimetry with Gafchromic films (Ashland 
Inc., Wayne, NJ) and flatbed scanners the dosimetry system of choice for many applications 
in radiation therapy. Radiochromic films darken upon irradiation, which makes it possible 
to measure the absorbed dose using a scanner. However, when they are digitized with a 
color scanner, three different dose distributions - one for each color channel - are obtained. 
To combine the information provided by all three channels into a single and more accurate 
dose distribution, multichannel dosimetry methods have been proposed 9 16 . 

An emerging field of research in multichannel radiochromic film dosimetry are pertur¬ 
bation models 11,13,15,16 . These models consider that, for each channel, the measured dose 
distribution deviates from the true absorbed dose distribution because of small perturbations 
in the film-scanner response. To combine all three color channels, certain assumptions on 
the characteristics of the perturbations are necessary. This approach has shown promising 
results 11,13,15-17 . 

The purpose of this study is to select the most accurate model for radiochromic film 
dosimetry using Gafchromic EBT2 and EBT3 films and a flatbed scanner. In order to do 
so, different channel independent perturbation (CHIP) models are compared. This work 
examines whether incorporating the information of the non-irradiated film, by scanning 
prior to irradiation, improves the dosimetry. To what extent perturbation models correct the 
deviation from the response at the center of the scanner along the axis parallel to the scanner 
lamp is analyzed. Finally, single-, dual- and triple-channel models are compared to determine 
if by increasing the number of combined color channels the accuracy of radiochromic film 
dosimetry is also increased. 

The results of this study should be considered applicable within the limits under which 
the models were tested. Through this work, the films were irradiated with megavoltage 
radiotherapy beams, in the dose range from about 20 cGy to 600 cGy, and whole (8 inch 
x 10 inch) films were scanned. The functional form of the sensitometric curves was a 
polynomial and the different lots were calibrated using the plan-based method 14 . 
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II. METHODS AND MATERIALS 


A. A general perturbation model 

Many disturbances affect the film-scanner response 2,11,18,19 : thickness variations in the 
active layer coated on the film, electronic noise, scanner instability, lateral artifact, local 
variations produced by systematic problems of the scanner, Newton rings, dust, scratches 
or other damage, etc. According to their persistence, disturbances can be classified into 
two groups: systematic local variations and random perturbations. Random perturbations 
(he., noise) are not consistent in time: they change or disappear between scan repetitions or 
between non-irradiated and irradiated film scans. Electronic noise, scanner instability and 
Newton rings can be included within this category. Systematic variations are consistent be¬ 
tween film scans. They include thickness variations in the active layer, lateral artifact, local 
variations produced by systematic problems of the scanner, etc. Some other disturbances 
such as dust, scratches or other damage can be random or systematic, but they generally 
produce large alterations in the response, cannot be treated as perturbations, and should be 
considered as another source of noise. 

The dose absorbed by the film at point r, D(r), can be described by: 

D(r ) = D k (z k (r) + T fc (r, z k )) + E fc (r) (1) 

where k represents the color channel (he., red (R), green (G) or blue (B)), D k denotes 
the sensitometric curve (for channel k), z k (r) is the film-scanner response (either pixel 
value, optical density (OD) or net optical density (NOD)) at point r, T fc (r, z k ) corrects the 
systematic local variations and T, k (r) represents noise disturbances. 

If ty k (r,z k ) is small, we can apply a first-order Taylor expansion of D(r) in terms of 
dR(r, z k ). Hence, a general perturbation model for D(r) can be expressed as: 

D(r) = D k (z k (r )) + D k (z k (r))^ k (r, z k ) + e k (r) (2) 

where D k is the first derivative of D k with respect to z k and e k (r) is an error term that 
accounts for the difference between the true absorbed dose, D(r), and the measured dose 
after correction of the perturbation. The noise is included in the error term. 
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B. Channel independent perturbation models in the literature 

1. The Micke and Mayer models 

Micke et al 11 were the first to suggest a perturbation model. Mayer et al 13 found a 
closed-form solution to that model. They proposed a CHIP model: 

D(r) = D k {z k {r )) + D k (z k (r))A(r) + e k (r) (3) 

which implicitly assumes that the probability density functions (pdf) of all the e k {r) 
terms are equal and the pdf of A is a uniform distribution 15 . 

2. The Truncated Normal model 

In an earlier article 15 , a generalization of the Micke-Mayer (MM) model was introduced. 
The pdf of A was considered to be a truncated normal (TN) distribution and the e k (r) 
terms could be different. 

In this study, the MM and TN models were compared while different elements of the 
functional form that translates film-scanner responses into doses were varied. It should 
be pointed out that the MM model is not the same as the Micke or Mayer multichannel 
film dosimetry methods. For example, the Mayer method uses pixel value as film-scanner 
response, by contrast, in this study OD and NOD were used. The Micke method uses 
a rational form for the sensitometric curves, by contrast, in this study the sensitometric 
curves followed 


D„ = t (4) 

n =0 

where b k are fitting parameters. 

However, both the Micke and Mayer methods share the same form of the perturbation, 
which diverges from the form defined by the TN model. 

C. Scanning before and after irradiation 

The response of the dosimetry system to irradiation is expressed in terms of pixel value, 
OD or NOD. OD is defined as: 
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where Uj rr denotes the pixel values of the irradiated him and u max is the maximum possible 
pixel value of the scanner. Between pixel value and OD there is only a change of coordinates. 
For this reason, in this study pixel value was not included in the comparison. 

The information of the non-irradiated him can be incorporated in the response. NOD is 
defined as: 
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where u non i rr denotes the pixel values of the non-irradiated him. 
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D. The lateral artifact 

The lateral artifact is the deviation from the response at the center of the scanner along 
the axis parallel to the scanner lamp 14 ’ 18 ’ 20 ~ 29 . It is caused by the interplay between the light 
scattering from the polymers created in the active layer of the him and the properties of the 
scanner 30 . It is approximately parabolic in shape, with lower pixel values along the edges 
than at the center of the scan. It is dependent on the color channel and OD. The lateral 
correction is modeled empirically. In this paper, lateral corrections are calculated as 14,27 : 

Vk = a kl (x- x c ) + a k2 (x- x c ) 2 + v k (7) 

where v k is the pixel value before correction, x is the coordinate of the pixel in the axis 
parallel to the CCD array, x c is the x coordinate of the center of the scanner, v k represents 
the corrected pixel value, and a k are fitting parameters. 

Multichannel correction methods have been found to substantially mitigate the lateral 
artifact 11,31 . This raises the question of whether additional steps to correct the lateral 
artifact are necessary. If the lateral correction is considered included in the perturbation 
term, additional steps are unnecessary. An alternative approach is to explicitly apply the 
lateral correction following Eq.(7). In this study, both approaches were compared. 
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E. Single-channel dosimetry vs. multichannel dosimetry 


Multichannel dosimetry combines the doses measured with different color channels 
(-Dfc(r)) into a single dose value (d(r)). Considering D^(r) as different measurements 
of the dose, if they were uncorrelated the uncertainty of the dose d{r ) could not be higher 
than the uncertainties of each of the channels separately. However, they are correlated 15 , 
which implies that the uncertainty of d{r ) can be higher than the uncertainty of Dk(r). 
Thus, multichannel dosimetry is not necessarily better than single-channel dosimetry. It 
depends, first, on the functional form of the particular multichannel model. According to 
the functional form, it can be derived that the uncertainty of the CHIP models compared 
in this work cannot worsen when the number of combined channels increases. Second, 
this is only valid if the functional form correctly describes the physics of radiochromic film 
dosimetry. 

In this study, single-, dual- and triple-color-channel dosimetry models were compared. 


F. Model selection 

The most accurate model for radiochromic film dosimetry will be the one with the highest 
degree of coincidence between the film dose (d(r)) and the true dose ( D{r )). The degree of 
coincidence is a qualitative magnitude, in this work it was quantified as the probability of 
both being equal. 

According to Bayesian probability theory, given the measured data, the probability that 
a model is the one where the data come from is proportional to the probability of ob¬ 
taining these data from the model times the prior probability of the model: P(M\D ) cx 
P(D\M) P(M). The ideal choice for the prior is the universal weight based on the Kol¬ 
mogorov complexity of the model 32,33 . Unfortunately, the Kolmogorov complexity is not 
finitely computable. A practical complexity-based approach consists of selecting the model 
using the Akaike information criterion (AIC) 34 . Thus, the most probable model is the one 
with the lowest AIC value: AIC - 2 cm - 2 \n(P(D\M)), where cm is the number of param¬ 
eters of the model. 

Usually, when selecting a model for film dosimetry, the functional form is predefined and 
the only parameters to select fit the lateral corrections and sensitometric curves. This is 
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done for each lot of films and is referred to as ‘calibration’. In this study, the functional form 


(MM or TN, OD or NOD, etc.) had to be selected as well. Each lot of films was calibrated 
with each functional form. For each calibration, to calculate P(D\M), it was considered 
that the differences (e^) between the reference doses (D(r)) and measured doses (d(r)) were 
normally distributed. Thus, and considering that the number of parameters was negligible 
compared to the size of the data sample, the model with the lowest root mean square error 
(RMSE) comparing the reference and measured doses was regarded as the most likely model. 
Therefore, given a functional form, the parameters of the model were selected minimizing 
the RMSE. In addition, for each lot, the most probable functional form was given by the 
model with the lowest RMSE. 

Henceforth, a given use of the term ‘model’ will refer to all those models with the same 
functional form. A calibration will select the parameters that maximize the likelihood of 
that model. 

To exclude the possible dependence on a particular lot, a sample with four different lots 
was employed. Calibrations of different lots had different dose ranges and quantities of data. 
To balance their weights, relative rather than absolute differences were compared. For each 
calibration, it was assumed that the relative differences between reference doses (D(r)) and 
measured doses ( d{r )) were also normally distributed: 


D(r) = d(r) + ea{r) 


( 8 ) 


with: 



( 9 ) 


where a is referred to as the relative ‘uncertainty of the calibration’. To estimate the 
uncertainty of the measured dose, a should be combined with the uncertainty of D(r) . 

In information theory, entropy is defined as the expected negative log-likelihood of a 
random variable. Hence, maximizing the likelihood of a model is equivalent to minimizing 
the information entropy of the errors. The (differential) entropy of a model following Eq.(9) 
can be calculated as: 



( 10 ) 
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Assigning the same weight to each lot, the entropy of a model in a sample of lots is 
the mean entropy of the model in the sample. As a result, it was considered that the most 
probable model for radiochromic him dosimetry was the one with the lowest geometric mean 
of the calibration uncertainty for all lots under study. 

G. Scanning protocol 

Gafchromic EBT2 and EBT3 films with dimensions 8 inch x 10 inch were used. They 
were handled in conformity to the recommendations of the AAPM TG-55 report 2 . The films 
were scanned with an Epson Expression 10000XL flatbed scanner (Seiko Epson Corporation, 
Nagano, Japan) prior to irradiation and within the time-window following irradiation. The 
scanner was warmed up for at least 30 min before use. Before acquisitions, and after long 
pauses, five empty scans were taken to stabilize the scanner lamp. The films were centered on 
the scanner with a black opaque cardboard frame. They were scanned in portrait orientation 
(he., the short side of the him parallel to the scanner lamp and the long side parallel to the 
lamp movement axis). Scans were acquired with image-type set to 48-bit RGB (16 bit per 
channel), a resolution of 72 dpi and image processing tools turned oh. They were saved as 
TIFF hies. Five consecutive scans were taken for each him and the hrst scan was discarded 
to avoid the warm-up effect of the scanner lamp occurring with multiple scans 22,25 . 


H. Calibration 

To calibrate a lot according to the models under comparison, a set of reference doses 
should be associated with a representative sample of responses, lateral positions and per¬ 
turbations. 

To obtain a representative calibration sample, the plan-based method was chosen 14 . Films 
were placed in a Plastic Water (Computerized Imaging Reference Systems Inc. Norfolk, VA, 
USA) phantom at source-axis distance (SAD). They were irradiated with a 6 MV photon 
beam and a 60° Enhanced Dynamic Wedge held of dimensions 20x20 cm 2 at SAD. The range 
of doses delivered to the him encompassed the range of doses of interest. For some lots, the 
irradiation was repeated with several films and different monitor units (MU). In this way, 
the dose range was extended and the intralot variability mitigated. The dose distribution 


in the plane of the him was either calculated with the treatment planning system (TPS) or, 
preferably, measured simultaneously with an 1BA MatriXX Evolution ionization chamber 
array (1BA Dosimetry GmbH, Germany). Since the reference dose plane can be either 
planned or measured, from now on this method will be referred to as the ‘plane-based’ 
method. 

The plane-based method is an alternative to the well-established calibration method 
with fragments 14 . A disadvantage of the plane-based method is that the reference doses have 
higher uncertainty than with the calibration with fragments. However, better precision in the 
reference dose is useless if the calibration sample is biased. The plane-based method provides 
a more representative calibration sample, making it more robust against perturbations. 
Additionally, another advantage of the plane-based method is the efficiency, since the time 
required for calibration is considerably reduced. 

Four lots were used, two of them with EBT2 films: lot A04141003BB (Lot A) and 
lot A03171101A (Lot B); and the other two with EBT3: lot A05151201 (Lot C) and lot 
A03181301 (Lot D). They were calibrated following slightly different variations of the pro¬ 
tocol: 

The Lot A films were irradiated at a depth of 6 cm in a 12x30x30 cm 3 phantom in slab 
form with a Novalis Tx accelerator (Varian, Palo Alto, CA, USA). One film was irradiated 
with a wedge held with 660 MU (the reference doses extended from 100 cGy to 600 cGy, 
approximately). The reference dose plane was calculated with Eclipse v.10.0 (Varian Medical 
Systems) using the Anisotropic Analytical Algorithm (AAA). The films were scanned in 
reflection mode and the waiting time-window was 24 ± 1 h. 

The Lot B and Lot C films were irradiated atop the 1BA MatriXX detector inside the 
1BA MatriXX Evolution MULTICube with a Novalis Tx accelerator. Three films were used 
for the calibration, two of them were irradiated with a wedge held with 535 MU (dose range: 
75-400 cGy, approximately) and the other one with a wedge with 401 MU (dose range: 
50-300 cGy, approximately). The reference doses were measured simultaneously. The films 
were scanned in reflection and transmission mode. The time-window was 20 ± 1 h. 

The Lot D films were also irradiated and measured with 1BA MatriXX Evolution. They 
were irradiated with a Varian 2100 CD accelerator. Three films were used for the calibration 
using wedge helds with 130, 350 and 550 MU (approximate dose ranges: 20-100, 50-270 and 
80-420 cGy, respectively). The films were scanned in transmission mode and the tirne- 
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window was 24 ± 1 h. 

Whenever the reference doses were measured with the detector array, the dose values 
were scaled with a factor of 1.015 in order to correct for the distance between the plane 
of the him and the plane of measurement of the detector. When the reference doses were 
calculated with the TPS, they were corrected with the daily output of the Linac. 

The reference dose planes were exported (with a resolution of 0.49 mm/px for the planned 
doses and 7.62 mm/px for doses measured with MatriXX). They were uploaded together with 
the him scans to Radiochromic.com (http://radiochromic.com). All the dosimetry models 
analyzed in this study were incorporated in a research version based on Radiochromic.com 
vl.6. 

A total of six calibration samples (i.e., one for each lot except for Lots B and C, which had 
two samples: one scanned in reflection and another in transmission mode) were uploaded. 

Each was calibrated using MM and TN perturbations, using OD and NOD as film- 
scanner response, applying and not applying the lateral correction defined in Eq.(7) and 
employing each of the seven possible combinations of color channels (i.e., R, G, B, RG, RB, 
GB, RGB). Taking into account that the MM and TN models only apply to multichannel 
combinations, 44 different models for radiochromic him dosimetry were compared for each 
of the calibration samples. Since Radiochromic.com optimizes the calibrations using an 
evolutionary algorithm, each calibration was repeated three times and the one with the 
lowest uncertainty was selected. 

I. Validation 

Given the calibration sample, the model with the lowest calibration uncertainty was con¬ 
sidered the most accurate one. This model is expected to correctly describe the relationship 
between the hlm-scanner response and the absolute dose absorbed by the him. However, 
this expectation can be disproved when studying new data, especially if the calibration 
sample is not representative. To validate the model selection, him dose distributions were 
compared with planned dose distributions by means of global gamma-index analyses 35 . Tol¬ 
erances were set at 3%, 3 mm excluding points with less than 30% of the maximum planned 
dose. This threshold was chosen to prevent extrapolating the sensitometric curves of the 
calibrations for lower doses when using OD. To avoid noise artifacts 36 (i.e., noise in the 
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evaluation distribution which spuriously improves the gamma-index), the film and planned 
distributions were, respectively, the reference and evaluation distributions. 

Benchmark test cases described in an earlier paper 15 were employed. To allow other 
researchers to reproduce and verify the results, or to improve the present models under 
analysis, these benchmark tests are publicly available at Radiochromic.com. They were con¬ 
sidered a representative sample of the dose distributions, both for the EBT2 and EBT3 hlms. 
The EBT2 films belonged to Lot B and the EBT3 films to Lot C. The films were exposed 
with a Novalis Tx accelerator and scanned, in reflection and transmission modes, following 
the same protocol as for the calibrations. Three phantoms were used: 1BA MatriXX Evolu¬ 
tion MULTICube, C1RS Pelvic Phantom (Model 002PRA, Computerized Imaging Reference 
Systems Inc. Norfolk, VA, USA) and C1RS Thorax Phantom (Model 002LFC). They were 
set up at SAD. The films were placed atop the MatriXX Evolution detector (coronal plane) 
in the MatriXX phantom and with an offset from the beam axis in the C1RS phantoms 
(transversal plane), of 1.5 cm in the thorax phantom and 1.3 cm in the pelvic phantom, to 
diminish beam attenuation by the f ilm 37 ’ 38 . Calibration and verification tests which were 
simultaneously measured with MatriXX were compared with the TPS, obtaining a mean y<! 
(3% 3mm) of 98.9%. 

The film scans (42 tests) were converted to dose distributions applying each of the 44 
dosimetry models of the calibration. Radiochromic.com was employed for the calculations. 


III. RESULTS AND DISCUSSION 

If the model selection based on the calibration is correct, the uncertainty of the calibration 
(a) should be correlated with the percentage of points with y<i in the gamma analysis. In 
Figure 1, it is found that the correlation between a and the mean, aggregating all 42 test 
cases, 7 <i (3% 3mm) is significant. 

In this section, the 44 models are compared. Each dosimetry model comprises: per¬ 
turbation form for multichannel models (MM/TN), response (OD/NOD), lateral correction 
(applied/not applied) and the combination of color channels (R, G, B, RG, RB, GB, RGB). 
Models that only differ in one of these elements will be contrasted according to the a and 
the mean 7 ^ (3% 3mm). For the sake of clarity of the presentation, points with a larger 
than 10 % or 7 ^ less than 80% are not displayed in the figures. 
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FIG. 1: Correlation between the uncertainty of the calibration and the mean, aggregating 
all 42 test cases, y<i (3% 3mm). The Spearman’s rank correlation coefficient and p -value 

are included 


A. MM vs. TN perturbations 

Figure 2 contrasts multichannel models that only differ in the form of the perturbation: 
they use either MM or TN perturbations. The TN models were found to be better than the 
MM models in every case. This confirms the recommendation of using TN perturbations as 
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FIG. 2: Contrast of multichannel models that only differ in the form of the perturbation: 
MM vs. TN perturbations. According to a) the uncertainty of the calibration and b) the 

mean 7 <x (3% 3mm) 


opposed to MM perturbations when using CHIP models. It is due to the fact that the TN 
perturbation generalizes MM and minimizes the uncertainty in the dose inherent to CHIP 
models 15 . 


B. Lateral correction 

Figure 3 contrasts the models that apply (Ly) with those that do not apply (Ln) lateral 
corrections. Applying lateral corrections improved the results for all the cases, even for 
multichannel models. 

Recently, a new empirical formula to correct the lateral artifact has been published by 
Poppinga et al 29 . Figure 4 illustrates how this formula compares with the correction used 
in this work (Eq.(7)), which will be denoted as Absolute correction. 

Four fragments from lot A irradiated with different doses were scanned at five different 
positions along the axis parallel to the scanner lamp. A 50 x 50 px ROI with resolution 
72dpi was measured at the center of each fragment. The lateral artifacts for all three color 
channels were fitted following the Absolute correction and the Poppinga correction formulas. 
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FIG. 3: Contrast of models that apply (Ly)/do not apply (Ln) lateral corrections. 
According to a) the uncertainty of the calibration and b) the mean y<! (3% 3mm). White 

points are models using MM perturbations. 


TABLE I: Residuals fitting lateral artifacts with the Absolute and Poppinga formulas for 

all three color channels. 




Residuals (pixel value) 


Lateral correction 

R 

G 

B 

Absolute 

65.3 

33.7 

28.4 

Poppinga 

46.3 

57.9 

38.9 


In Figure 4 deviations in pixel value from the value measured at the center of the scanner are 
plotted. The maximum percentage differences between OD obtained using both correction 
methods were 0.9%, 0.3% and 0.3% for the R, G and B channels, respectively. 

To evaluate both corrections, residuals of the fits were calculated, which are shown in 
Table I. The Poppinga formula was better for the R channel, and the Absolute formula for 
the G and B channels. It was considered that both lateral corrections were equally valid for 
empirically modeling the lateral artifact present in radiochromic him dosimetry. 
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(R) (G) (B) 

FIG. 4: Film fragments scanned at different positions along the axis parallel to the 
scanner lamp for the three color channels. Deviations in pixel value from the value 
measured at the center of the scanner are shown. The colored points represent 

measurements, the dashed lines Absolute corrections, and the dotted lines and empty 
points Poppinga corrections. The shape of the point identifies the dose level of the 
fragment: 0 Gy (□), 1 Gy (o), 4 Gy (A), 6 Gy (O). 

C. Scanning before and after irradiation 

Figure 5 contrasts models that only need the information of the irradiated scan (he., use 
OD as response) with models that also need the information of the non-irradiated scan (he., 
use NOD as response). White points are models that do not apply lateral corrections or use 
MM perturbations. 

Incorporating the information of the non-irradiated scan was found to be correlated with 
lower a and higher 7 ^ (3% 3mm). 

When using NOD, dose-independent perturbations in the hlm-scanner response (e.g., 
thickness variations of non-active layers) are canceled out, dose-dependent perturbations, 
like the lateral artifact or him inhomogeneities, are reduced. However, this is valid if the 
non-irradiated and the irradiated him images are perfectly registered. Registration errors 
introduce another source of noise/uncertainty that will increase in importance the steeper the 
gradients of the perturbations are. Registration errors and pixel value noise could increase 
the noise in the him dose distributions. Even though higher noise in the reference dose 
distribution is not expected to improve the results of the gamma analysis, in contrast to 
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FIG. 5: Contrast of models using OD vs. NOD as film-scanner response, according to the 
uncertainty of the calibration, and according to the mean 7 ^ (3% 3mm). White points are 
models that do not apply lateral corrections or use MM perturbations. 


higher noise in the evaluation distribution, the effect of increasing the noise in the him dose 
distribution was studied. The dose distribution of a RapidArc prostate plan (case J from 
the benchmark tests ) 15 was calculated using the TN-NOD-Ly-RGB model (comprising TN 
perturbation, with NOD as response, applying lateral correction, and employing all three 
color channels). The him belonged to Lot C scanned in reflection mode. The noise in the 
him dose map was either smoothed using median hlters or increased with Gaussian noise. 
The percentage of points with y<i was calculated using tolerances of 3% 3mm and 2% 2mm. 
Data are shown in Table II. It was found that higher noise in the dose distribution did not 
improve the gamma results. 


D. Single-channel dosimetry vs. multichannel dosimetry 

Table III and Table IV include, respectively, the uncertainty of the calibration and the 
mean percentage of points with y<i (3% 3mm), taking into account all 42 test cases, for 
the dosimetry models under comparison. It can be observed that increasing the number of 
color channels does not necessarily result in more accurate him doses. This is conspicuous 
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TABLE II: Percentage of points with 7 <i (2% 2mm) and 7 <i (3% 3mm) for the test J of 
Lot C scanned in reflection mode after filtering the dose distribution with median Liters or 

Gaussian noise. 


Filter 

7 <i( 2 % 2mm) 

7<i(3% 3mm) 

Median filter 7x7 px 

97.2 

99.8 

Median filter 5x5 px 

97.1 

99.8 

Median filter 3x3 px 

96.9 

99.8 

No filter 

96.0 

99.6 

1% standard deviation noise 

95.1 

99.3 

2% standard deviation noise 

92.6 

98.3 

3% standard deviation noise 

88.7 

96.8 


for the MM-RG models, which have very high uncertainties and lower mean 7 <:L (3% 3mm) 
values than the corresponding R single-channel models. These high uncertainties and lower 
mean 7 <i values were caused by the sensitivity of the inherent dose uncertainty in the MM 
model with the properties of the sensitometric curves 15 ; in particular, in these cases, poor 
outcomes occurred because the sensitometric curves of both the R and G channels were very 
similar. 

From the data presented in Table III and Table IV, it is found that applying lateral 
corrections and scanning prior and following to irradiation had a larger effect on the results 
than using multichannel models. For instance, the OD-Ln-R model had 5.4% of calibration 
uncertainty and the mean percentage of points with 7 <i 3% 3mm was 86%, the NOD-Ly-R 
model had 1.8% and 96.4% of calibration uncertainty and mean y<! 3% 3mm, respectively, 
a greater improvement than the one obtained with the MM-OD-Ln-RGB (4.3%, 89.6%) or 
the TN-OD-Ln-RGB model (2.5%, 95.0%). 

Both taking into account the calibration uncertainty and the mean percentage of points 
with 7 <i (3% 3mm), the dosimetry model comprising TN perturbation, with NOD as re¬ 
sponse, applying the lateral correction, and employing all three color channels (be., the 
TN-NOD-Ly-RGB model) obtained the best results and was regarded as the most accurate 
model in the study. 
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TABLE III: Uncertainty of the calibration for the dosimetry models under comparison. 


Dosimetry model 



Color channel combination 



R 

G 

RG 

B 

RB 

GB 

RGB 

MM-OD-Ln 

5.4 

3.4 

81.7 

9.5 

6.3 

4.5 

4.3 

MM-OD-Ly 

2.0 

2.4 

20.6 

6.3 

2.7 

3.5 

2.8 

MM-NOD-Ln 

4.3 

2.5 

62.8 

5.6 

7.0 

3.6 

4.1 

MM-NOD-Ly 

1.8 

2.2 

14.7 

5.3 

2.3 

3.4 

2.4 

TN-OD-Ln 

5.4 

3.4 

3.6 

9.5 

4.3 

2.4 

2.5 

TN-OD-Ly 

2.0 

2.4 

1.9 

6.3 

1.8 

2.0 

1.6 

TN-NOD-Ln 

4.3 

2.5 

2.9 

5.6 

3.9 

2.2 

2.5 

TN-NOD-Ly 

1.8 

2.2 

1.6 

5.3 

1.6 

2.0 

1.5 


TABLE IV: Mean percentage of points, taking into account all 42 test cases, with y<i (3% 
3mm) for the dosimetry models under comparison. 


Color channel combination 


Dosimetry model 

R 

G 

RG 

B 

RB 

GB 

RGB 

MM-OD-Ln 

86.0 

85.4 

59.5 

59.2 

88.0 

80.6 

89.6 

MM-OD-Ly 

92.8 

86.9 

72.1 

64.6 

93.6 

87.6 

94.5 

MM-NOD-Ln 

89.9 

91.6 

59.6 

76.5 

84.9 

86.9 

89.0 

MM-NOD-Ly 

96.4 

92.3 

75.3 

76.2 

95.4 

88.2 

94.8 

TN-OD-Ln 

86.0 

85.4 

90.2 

59.2 

90.4 

90.7 

95.0 

TN-OD-Ly 

92.8 

86.9 

93.5 

64.6 

96.2 

91.9 

96.4 

TN-NOD-Ln 

89.9 

91.6 

93.3 

76.5 

91.2 

92.7 

95.0 

TN-NOD-Ly 

96.4 

92.3 

96.8 

76.2 

96.9 

93.3 

97.1 


The models which were among the ten best according to each of these dimensions: cali¬ 
bration uncertainty (cr), mean percentage of points with y<i 2 % 2 mm (denoted as mean 722 ) 
and 3% 3mm (mean 733 ), both gamma dimensions taking into account the 42 test cases, 
are shown in Table V. According to these criteria, there were six models. All of them use 
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TABLE V: The best models according to: calibration uncertainty (a), mean percentage of 
points with 7<x 2% 2mm (mean 722) and 3% 3mm (mean 733), both gamma dimensions 
taking into account the 42 test cases. They are sorted by mean 733. 


Dosimetry model 

*(%) 

mean y 22 (%) 

mean 733 (%) 

TN-N OD-Ly-RGB 

1.5 

88.9 

97.1 

TN-NOD-Ly-RB 

1.6 

88.1 

96.9 

TN-NOD-Ly-RG 

1.6 

88.6 

96.8 

TN-OD-Ly-RGB 

1.6 

87.2 

96.4 

NOD-Ly-R 

1.8 

87.6 

96.4 

TN-OD-Ly-RB 

1.8 

86.4 

96.2 


lateral corrections. Five of them are multichannel models, the other one is a single-channel 
(R) model. All the multichannel models use TN perturbations. Four out of six models 
incorporate the information of the non-irradiated film scan. All these models include the R 
color channel, which is the channel with the highest absorption 39 . 

E. Statistical hypothesis testing 

The model selection based on the uncertainty of the calibration using the plane-based 
method was confirmed as a convenient approach to compare film dosimetry models, in 
the sense that lower uncertainties are correlated with higher expected 7 <i values in the 
gamma analysis. Both model selection and expected gamma approaches produced the same 
conclusions. An additional validation was conducted by testing the statistical significance 
of the gamma analysis results. 

Models were compared using paired differences of 7<i(3% 3mm) values. The sample 
was composed of the 42 benchmark cases. Since the probability density functions of 7 ^ 
differences were not normal in general, the Wilcoxon signed-rank test was used. Thus, 
the null hypothesis was that the median difference between paired observations was zero. 
P -values of less than 0.05 were considered as statistically significant, i.e., indicative that 
one of the models provided higher 7<i(3% 3mm) results. There are 990 possible pairwise 
comparisons with 44 models, for clarity only the 13 comparisons considered most relevant 
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TABLE VI: Comparison of models using paired differences of 7<i(3% 3mm) values. The 
sample is composed of the 42 test cases. Median 7 ^ values are shown. Positive values 
indicate that model A obtained better results, and the opposite for negative values. 
P-values using the Wilcoxon signed-rank test are included. 


Model A 

Model B 

t£-7?i 

p -value 

OD-Ln-R 

OD-Ly-R 

-4.47 

<0.01 

OD-Ln-R 

NOD-Ln-R 

-3.05 

<0.01 

OD-Ln-R 

NOD-Ly-R 

-7.67 

<0.01 

OD-Ly-R 

NOD-Ly-R 

-1.27 

<0.01 

OD-Ly-R 

NOD-Ln-R 

1.58 

0.05 

NOD-Ln-R 

NOD-Ly-R 

-3.75 

<0.01 

NOD-Ly-R 

MM-NOD-Ly-RGB 

1.38 

0.02 

NOD-Ly-R 

TN-NOD-Ly-RGB 

-0.11 

0.02 

MM-N OD-Ln-RGB 

MM-NOD-Ly-RGB 

-2.47 

<0.01 

MM-NOD-Ly-RGB 

MM-OD-Ly-RGB 

0.22 

0.47 

MM-NOD-Ly-RGB 

TN-NOD-Ly-RGB 

-1.78 

<0.01 

TN-N OD-Ln-RGB 

TN-NOD-Ly-RGB 

-0.48 

<0.01 

TN-NOD-Ly-RGB 

TN-OD-Ly-RGB 

0.01 

0.14 


were studied. They are shown in Table VI. It was found that applying lateral corrections 
significantly improved the gamma analysis results both for single-channel (R) and triple¬ 
channel models. Using NOD as film-scanner response improved the results, significantly for 
single-channel (R) but not significantly for triple-channel models. The NOD-Ly-R model was 
found to be significantly better than the MM-NOD-Ly-RGB model, and the most accurate 
model according to the model selection approach (he., the TN-NOD-Ly-RGB model) also 
produced significantly higher 7<i(3% 3mm) values than both the MM-NOD-Ly-RGB and 
the NOD-Ly-R models. 


20 







IV. CONCLUSIONS 


Under the scope of applicability defined by the limits under which the models were tested 
(he., the films were irradiated with megavoltage radiotherapy beams, with doses in the dose 
range of 20-600 cGy, entire films were scanned, the functional form of the sensitometric 
curves was a polynomial and the calibration followed the plane-based method 14 ), it was 
confirmed that using TN perturbations provided better results than using MM perturbation 
models. It was found that applying lateral corrections produced more accurate film doses. 
The same occurred if the information of the non-irradiated him was incorporated by scanning 
prior to irradiation, however this improvement did not produce significantly higher 7<i(3% 
3mm) values with triple-channel models. Scanning prior to irradiation and applying lateral 
corrections had a greater effect in terms of improving the accuracy of the results than 
the increase of the number of combined color channels, which, for some models, did not 
necessarily improve the results. 

Among the models under comparison, the most accurate was found to be the dosimetry 
model comprising TN perturbation, with NOD as response, applying the lateral correction, 
and employing all three color channels (i.e., the TN-NOD-Ly-RGB model). 
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